Baseball Scholar — MLB Player Statistics Dataset
1901–Present

DESCRIPTION
-----------
This dataset contains individual Major League Baseball player statistics for every season from 1901 to the present. Each row represents a single player season and includes standard batting, pitching, and fielding positions, depending on the file.

The dataset is provided for historical research, analysis, and educational use.

CONTENTS
--------
This ZIP archive contains the following file(s):

- the-baseball-scholar-mlb-player-stats-1901-present.csv
- the-baseball-scholar-mlb-player-stats-1901-present.xlsx
- the-baseball-scholar-batting-stats-1901-present.csv
- the-baseball-scholar-pitching-stats-1901-present.csv
  (or batting/pitching-only file, depending on archive)

FILE FORMAT
-----------
CSV files are comma-delimited text files compatible with Excel, Google Sheets, Python, R, and most statistical software. Excel (.xlsx) files are provided in native Microsoft Excel format.

DATA COVERAGE
-------------
Seasons: 1901–Present  
Level: Major League Baseball  
Granularity: One row per player per season  

DATA SOURCES
------------
This dataset is compiled from publicly available historical Major League Baseball records. The data has been standardized to ensure consistent formatting across eras.

METHODOLOGY & LIMITATIONS
-------------------------
- Statistical definitions reflect the official scoring rules in effect during each season.
- Rule changes, season lengths, and historical record-keeping practices may affect cross-era comparisons.
- This dataset is intended for analytical use and does not attempt to retroactively normalize statistics.

Users are encouraged to account for historical context when performing analysis.

UPDATES
-------
The dataset is updated periodically as new seasons conclude and corrections are identified.

ATTRIBUTION
-----------
If you use this dataset in published work, articles, or public projects, attribution to "The Baseball Scholar" is appreciated.

Website:
https://thebaseballscholar.com/data-library

LICENSE
-------
This dataset is provided free of charge for educational and research purposes.
